A Structured Text ADT for Object-Relational Databases
نویسندگان
چکیده
There is a growing need, both for use within corporate intranets and within the rapidly evolving World Wide Web, to develop tools that are able to retrieve relevant textual information rapidly, to present textual information in a meaningful way, and to integrate textual information with related data retrieved from other sources. This paper introduces a model for structured text and presents a small set of operations that may be applied against this model. Using these operations structured text may be selected, marked, fragmented, and transformed into relations for use in relational and object oriented database systems. The extended functionality has been accepted for inclusion within the SQL/MM standard, and a prototype database engine that supports SQL with extensions to incorporate the proposed text operations has been implemented. This prototype serves as a proof of concept intended to address industrial concerns, and it demonstrates the power of the proposed abstract data type for structured text.
منابع مشابه
Array-Based Evaluation of Multi-Dimensional Queries in Object-Relational Databases Systems
Since multi-dimensional arrays are a natural data structure for supporting multi-dimensional queries, and object-relational database systems support multi-dimensional array ADTs, it is natural to ask if a multi-dimensional array-based ADT can be used to improve O/R DBMS performance on multi-dimensional queries. As an initial step toward answering this question, we have implemented a multi-dimen...
متن کاملAn Object - Oriented View Onto Public , Heterogeneous
Even though companies maintain highly-structured, traditional business data in relational databases, large amounts of information are available in semi-structured text sources, such as indexed online newspapers , patent information, literature citations or business prooles. This information is ooered by commercial providers who maintain complete control over access language, schemas and update ...
متن کاملExtending SGML to Accommodate Database Functions: A Methodological Overview
* Partially supported by US Dept. of Education award number P200A502367 and NSF Research and Infrastructure grant, award number NSF CDA-9303189. Abstract A method for augmenting an SGML document repository with database functionality is presented. SGML [ISO 8879, 1986] has been widely accepted as a standard language for writing text with added structural information that gives the text greater ...
متن کاملDeclarative Information Extraction in a Probabilistic Database System
Full-text documents represent a large fraction of the world’s data. Although not structured per se, they often contain snippets of structured information within them: e.g., names, addresses, and document titles. Information Extraction (IE) techniques identify such structured information in text. In recent years, database research has pursued IE on two fronts: declarative languages and systems f...
متن کاملE-ADTs: Turbo-Charging Complex Data
The next generation of database applications will be dominated by rich and complex data types. The ADT technology of today's object-relational database systems cannot provide adequate performance for such applications. The basic stumbling block is the lack of semantic knowledge provided to the database system about the ADT operations. We are developing novel \Enhanced ADT" (E-ADT) technology th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- TAPOS
دوره 4 شماره
صفحات -
تاریخ انتشار 1998